Picture for Xiao Chen

Xiao Chen

SpeechEditBench: A Bilingual Multi-Attribute Benchmark for Instruction-Guided Speech Editing

Add code
Jun 01, 2026
Viaarxiv icon

Skill-Conditioned Gated Self-Distillation for LLM Reasoning

Add code
May 27, 2026
Viaarxiv icon

Imagine2Real: Towards Zero-shot Humanoid-Object Interaction via Video Generative Priors

Add code
May 21, 2026
Viaarxiv icon

CCD-Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs

Add code
May 11, 2026
Viaarxiv icon

Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism

Add code
Mar 31, 2026
Viaarxiv icon

GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning

Add code
Mar 24, 2026
Viaarxiv icon

Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models

Add code
Mar 10, 2026
Viaarxiv icon

RADAR: Benchmarking Vision-Language-Action Generalization via Real-World Dynamics, Spatial-Physical Intelligence, and Autonomous Evaluation

Add code
Feb 11, 2026
Viaarxiv icon

FedAdaVR: Adaptive Variance Reduction for Robust Federated Learning under Limited Client Participation

Add code
Jan 29, 2026
Viaarxiv icon

PROST-LLM: Progressively Enhancing the Speech-to-Speech Translation Capability in LLMs

Add code
Jan 23, 2026
Viaarxiv icon